A Full-Text Learning to Rank Dataset for Medical Information Retrieval
نویسندگان
چکیده
I let q ∈ {0, 1} and d ∈ {0, 1} be query and document vectors, dimensions indicating word occurrence for dictionaries of size Q and D I score function f(q,d) = qWd = ∑Q i=1 ∑D j=1 qiWijdj , where W ∈ RQ×D is a matrix of word associations between query and document dictionaries I R is a set of tuples (q,d+,d−), document d being more relevant for query q than d− I relevance rank rq,d, rank di erence m(q,d+,d−) = rq,d+ − rq,d− I RankBoost: Lexp = ∑ (q,d+,d−)∈R m(q,d+,d−)ef(q,d +)−f(q,d−)
منابع مشابه
ارائه الگوریتمی مبتنی بر یادگیری جمعی به منظور یادگیری رتبهبندی در بازیابی اطلاعات
Learning to rank refers to machine learning techniques for training a model in a ranking task. Learning to rank has been shown to be useful in many applications of information retrieval, natural language processing, and data mining. Learning to rank can be described by two systems: a learning system and a ranking system. The learning system takes training data as input and constructs a ranking ...
متن کاملEffective Learning to Rank Persian Web Content
Persian language is one of the most widely used languages in the Web environment. Hence, the Persian Web includes invaluable information that is required to be retrieved effectively. Similar to other languages, ranking algorithms for the Persian Web content, deal with different challenges, such as applicability issues in real-world situations as well as the lack of user modeling. CF-Rank, as a ...
متن کاملDeep Neural Network for Learning to Rank Query-Text Pairs
This paper considers the problem of document ranking in information retrieval systems by Learning to Rank. We propose ConvRankNet combining a Siamese Convolutional Neural Network encoder and the RankNet ranking model which could be trained in an end-to-end fashion. We prove a general result justifying the linear test-time complexity of pairwise Learning to Rank approach. Experiments on the OHSU...
متن کاملLearning to Rank Semantic Coherence for Topic Segmentation
Topic segmentation plays an important role for discourse parsing and information retrieval. Due to the absence of training data, previous work mainly adopts unsupervised methods to rank semantic coherence between paragraphs for topic segmentation. In this paper, we present an intuitive and simple idea to automatically create a “quasi” training dataset, which includes a large amount of text pair...
متن کاملLETOR: A Benchmark Collection LETOR: A Benchmark Collection for Learning to Rank for Information Retrieval
Learning to rank has attracted great attention recently in both information retrieval and machine learning communities. However, the lack of public dataset had stood in its way until the LETOR benchmark dataset (actually a group of three datasets) was released in the SIGIR 2007 workshop on Learning to Rank for Information Retrieval (LR4IR 2007). Since then, this dataset has been widely used in ...
متن کامل